Sequence logos: a new way to display consensus sequences.
نویسندگان
چکیده
A graphical method is presented for displaying the patterns in a set of aligned sequences. The characters representing the sequence are stacked on top of each other for each position in the aligned sequences. The height of each letter is made proportional to its frequency, and the letters are sorted so the most common one is on top. The height of the entire stack is then adjusted to signify the information content of the sequences at that position. From these 'sequence logos', one can determine not only the consensus sequence but also the relative frequency of bases and the information content (measured in bits) at every position in a site or sequence. The logo displays both significant residues and subtle sequence patterns.
منابع مشابه
Visualizing bacterial tRNA identity determinants and antideterminants using function logos and inverse function logos
Sequence logos are stacked bar graphs that generalize the notion of consensus sequence. They employ entropy statistics very effectively to display variation in a structural alignment of sequences of a common function, while emphasizing its over-represented features. Yet sequence logos cannot display features that distinguish functional subclasses within a structurally related superfamily nor do...
متن کاملenoLOGOS: a versatile web tool for energy normalized sequence logos
enoLOGOS is a web-based tool that generates sequence logos from various input sources. Sequence logos have become a popular way to graphically represent DNA and amino acid sequence patterns from a set of aligned sequences. Each position of the alignment is represented by a column of stacked symbols with its total height reflecting the information content in this position. Currently, the availab...
متن کاملA simulated annealing algorithm for finding consensus sequences
MOTIVATION A consensus sequence for a family of related sequences is, as the name suggests, a sequence that captures the features common to most members of the family. Consensus sequences are important in various DNA sequencing applications and are a convenient way to characterize a family of molecules. RESULTS This paper describes a new algorithm for finding a consensus sequence, using the p...
متن کاملRNALogo: a new approach to display structural RNA alignment
Regulatory RNAs play essential roles in many essential biological processes, ranging from gene regulation to protein synthesis. This work presents a web-based tool, RNALogo, to create a new graphical representation of the patterns in a multiple RNA sequence alignment with a consensus structure. The RNALogo graph can indicate significant features within an RNA sequence alignment and its consensu...
متن کاملOn Base-Pairing Potential Between 16S rRNA and 5’ UTR in Archaebacterial Genomes
The Shine-Dalgarno (SD) sequence [4] of E. coli is known to be a signal to initiate translation. The widely accepted model is that the 3’ end of 16S rRNA base-pairs with the SD sequence in the first step of ribosome binding to mRNA. However, archaebacteria have been supposed to have systems of translation different from those of eubacteria and eucaryotes. Further, some eubacteria, such as M. ge...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Nucleic acids research
دوره 18 20 شماره
صفحات -
تاریخ انتشار 1990